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resources interconnected by a computer network and com- 
prised of a plurality of limited supply resources (such as 
those associated with multimedia content servers, e.g., 
bandwidth, CPU, storage, etc,,). The system comprises a 
plurality of server resources, preferably with target multi- 
media content, real time processing engines, etc., and a 
plurality of intermediary system resources (such as directo- 
ries and resource monitors). The resources in any particular 
server are classified or partitioned as global or local, where 
the ratio of global to local is specified and controlled by the 
system. The system assigns clients across local and global 
resources and coordinates the placement of replicas of target 
content across global resources. The placement is dynamic 
and performed when necessary based on the analysis of 
utilization patterns of target content and replicas by plurali- 
ties of clients. 
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SELF-REGULATED RESOURCE 
MANAGEMENT OF DISTRIBUTED 
COMPUTER RESOURCES 



BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates generally to techniques for 
managing distributed multimedia resources on wide area 
networks characterized by large bandwidth and resource 
reservation capabilities, and, more specifically, to a system 
that provides the ability to map, reserve, monitor, and 
manage a dynamic plurality of computing and storage 
resources via a collection of arbitrators, policies and direc- 
tories. 

2. Discussion of the Prior Art 

The task of leveraging the increased availability of widely 
distributed content and resources becomes very important 
with the proliferation of the next generation of the Internet, 
e.g., Intemet2. The emerging Internet projects address the 
creation of a leading edge network for a new generation of 
applications that fully exploit the capabilities of broadband 
networks. Very high bandwidth and bandwidth reservation 
will allow materials such as continuous digital video and 
audio to move from research use to much broader use and 
include images, audio, and video in a way currently not 
possible. In such a widely distributed environment, 
accountable, efficient, and self-regulated management of 
resources will be desirable and most importantly, necessary. 

The driving force behind the movement of Internet to the 
next generation is the commercialization of rich multimedia 
content. Digital library collections produced by 
corporations, entertainment material created by movie 
studios, and interactive instructional presentations devel- 
oped by universities are soon to be available over the 
Internet, thus creating a new and broad source of revenue. 

The emerging Internet relies on the bandwidth, which is 
on the order of several magnitudes larger than current 
Internet provides. It also alleviates network resource man- 
agement and QoS control by introducing correspondent 
reservation and monitoring mechanisms. However, it is 
clear, that to date, mechanisms for the collective manage- 
ment of multiple media connections that efficiently leverage 
the sharing of resources across multiple servers in a wide 
area network are not found. 

There is envisioned three major conditions for successful 
commercialization of those newly arising applications: first, 
mechanisms need be provided to allow paying users to 
establish a contract with service providers to reserve 
required infrastructure components and resources at a mutu- 
ally agreed price for which providers establish and support 
a guaranteed quality of service; second, the resources supply 
would have to be sufficient to meet random changes of the 
demand, which may be completely unpredictable during 
architectural studies; and, third, service providers should 
safely rely on the system for effective security, rights and 
royalties management, accounting and billing for the con- 
sumption of dynamically re -configurable distributed virtual 
resources. 

The current focus of resource management in the today's 
Internet, if any, relates to the setup and management of 
individual and independent media connections to server 
resources. However, the danger of this approach becomes 
clear when the presentations reuse multiple primary sources 
of content. To enforce the necessary quality as well as to 
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control the usage and distribution when reusing multiple 
sources of content, two approaches are possible. One 
approach is to copy aU content onto a single server (or a 
cluster of servers) during authoring, and replicating, as 

5 necessary, the final result to as many servers according to 
predicted demand. Primary content providers would then 
establish copyright charges, based on a priori market analy- 
sis. On the positive side, the control of distribut ion, security, 
and billing functions become much easier, than in case of 

10 distributed content. On the negative, if the demand is 
estimated incorrectly, the profit is not maximized for either 
primary or secondary (i.e., reuse) content providers. Finally, 
the most dangerous problem is that this approach leads to 
over-engineering of resources while it does not prevent 

15 dropout of excessive requests. Such an approach is typical 
for today's Internet, because current multimedia content is 
generally not memory hungry, as compared with emerging 
multimedia applications. 
Another approach would be to reassemble content on a 

20 need basis, for both authoring and dissemination. It would 
allow content to be stored once, but used as many times as 
necessary, establish charges proportionally to content and 
resources usage, and alleviate storage demand. However, it 
requires a system to dynamically manage multiple and 

25 frequently heterogeneous resources. In addition, this 
approach exacerbates security, and resource engineering. A 
demand for the particular segment can not be predicted at 
all, because this segment may be used in completely 
different, even orthogonal applications. Now, if the demand 

30 for a single segment can not be met, multiple applications 
are affected. The latter approach, however, is the only 
sensible way to be used by future Internet, because from the 
resource point of view, it is the most economical, and 
serving a maximum number of users. 

35 Thus, it would be highly desirable to provide a system and 
method that allows all three major commercialization con- 
ditions to be satisfied. 

There are a number of publications and patents in the area 
of QoS-driven resource management. The most of the work 

40 has been focused on either the network, as described in U.S. 
Pat. No. 5,388,097 issued Feb. 7, 1995 to Baugher, M. J. et 
al., and entitled "System and Method for Bandwidth Res- 
ervation for Multimedia Traffic in Communication 
Networks/' and U.S. Pat. No. 5,581,703 issued Dec. 3, 1996 

45 to Baugher, M. J. et al, and entitled "Method and Apparatus 
for Reserving System Resources to assure Quality of Ser- 
vice"; or, the operating system, such as described in the 
reference "An Architecture Towards Efficient Os Support for 
Distributed Multimedia", Proceedings of IS&T/SPIE Mul- 

50 timedia Computing and Networking Conference '96, San 
Jose, Calif., January 1996 by David K. Y. Yau and Simon S. 
Lam. With the proliferation of multimedia services on 
Internet, it was soon realized that while IP networks were 
able to provide a simple, best-effort delivery service, the IP 

55 protocol is not suited for use with new real-time 
applications, such as multimedia streaming, Virtual Reality 
applications, distributed supercomputing. As a result, new 
network protocols, such as Resource Reservation Setup 
Protocol (RSVP) (See, e.g., "The Grid: Blueprint for a New 

60 Computing Infrastructure," Edited by Ian Foster and Carl 
Kesselman, Chapter 19, pp. 379-503, Morgan Kauffman 
Publishers, 1999); Real Time Transport Protocol (RTP); 
Real Time Transport Control Protocol (RTCP) and others, 
were developed (See, e.g., William Stallings, "High-Speed 

65 Networks: TCP/IP and ATM Design Principles", Prentice 
Hall, 1997; and, I. Busse, B. Deffner, and H. Schulzrinne, 
"Dynamic QoS Control of Multimedia Applications based 
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on RTP", Computer Communications, January 1996), Columbia University such as described in the reference 

enabling applications to request and negotiate network QoS entitled "Building Open Programmable Multimedia 

parameters, such as bandwidth and latency. Deployment of Networks 5 *, Computer Communications Journal, Vol. 21, 

those protocols on the current Internet has not been No. 8, pp. 758-770, June 1998 by Campbell, A. T.,Lazar, A. 

successful, firstly because it required upgrading all the 5 A., Schulzrinne, H. And R. Stadler; OMEGA end-point 

non-RSVP routers and servers system software. Secondly, architecture, developed as the interdisciplinary research 

even if RSVP were deployed on the current Internet, very effort in the University of Pennsylvania such as described in 

limited bandwidth and computing resources would still have the reference entitled "Design, Implementation and Experi- 

been the bottleneck for successful deployment of real-time ences of the OMEGA End-Point Architecture", Technical 

applications. The current Internet was built on the backbone, 1Q Report (MS-CIS-95-22), University of Pennsylvania, May 

enabling cross-country communications on relatively 1995 by Nahrstedt K. And J. Smith; in-serv Architecture 

unclogged 13 (45 megabit per second). Proliferation of which is a contribution of the Internet Engineering Task 

graphic pages, and streaming audio and video applications Force (IETF) such as described in the reference entitled "A 

depleted those resources quite fast. Even worse, the rate of Framework for End-to-End QoS Combining RSVP/Intserv 

user's population growth is considerably higher than newly 5 and Differentiated Services," Internet Draft, IETF, March 

build network resources. 1998 by Bemet Y, et al.; the Quality of Service Architecture 

The National Science Foundation and MQ Corporation, QoS-A, developed by A. Campbell, and presenting an inte- 

responding to the emerging needs of Internet community has grated framework dealing with end-to-end QoS require- 

been building a new network, called vBNS (very -high- ments such as described in the reference entitled "A Quality 

performance Backbone Network Service). This nationwide 20 of Service Architecture", PhD thesis, Lancaster University, 

network also provides a backbone for the two foundations, January 1996 by Andrew T Campbell. Another reference 

university-led effort called Internet 2 and by federal research which analyzes the above mentioned QoS paper is entitled 

agencies, called New Generation Internet. The vBNS allows "A Survey of QoS Architectures", ACM/Springer Verlag, 

most of the connected institutions to run at 622 million bits Multimedia Systems Journal, Special Issue on QoS 

per second (OC12). By the year 2000, vBNS is expected to M Architecture, Vol. 6, No. 3, pp. 138-151, May 1998 by 

operate at 2.4 gigabits per second (2,400 megabits per Aurrecoechea, C, Campbell, A. T. and L. Hauw. 

second) by the year 2000. Substantial work has been done by SRI International, 

The vBNS system exploits RSVP protocol to support two developing an End-to-End Resource Management of Dis- 

distinct classes of services: a Reserved Bandwidth Service, tributed Systems (ERDoS), which enables adaptive, end-to - 

i.e. a service with bandwidth commitment, and a traditional 30 end, scalable resource management of distributed systems 

best-effort IP service (See, e.g., Chuck Song, Laura Cun- such as described in the reference ERDOS QoS 

ningham and Rick Wilder, "Quality of Service Development Architecture, Technical Report, SRI International, May 

in the vBNS", MCI Communications Corporation, provided 1998. An extensible Resource Specification Language 

at the URL http://www.vbns.net/presentations/papers/ (RSL) and the resource management architecture has been 

QoSDev/ieeeqos.htm. Still, resource management at the 35 implemented within Globus meta-computing toolkit, and 

network layer for vBNS is done separately from operating used to implement a variety of different resource manage- 

system layer and in isolation from application needs and ment strategies such as described in Czajkowski, K., et al., 

availability of the end-resources, such as storage and com- "A Resource Management Architecture for Metacomputing 

puting resources. Systems" Proc. IPPS/SPDP '98 Workshop on Job Schedul- 

A new breed of high performance applications such as 40 i°g Strategies for Parallel Processing, 1998; and Foster, I., 

remote surgery, robotics, tele-instrumentation, automated Kesselman, C, "The Globus Product: A Status Report" Proc. 

crisis response, digital libraries of satellite data, distance IPPS/SPDP '98 Heterogeneous Computing Workshop, pp. 

learning via multimedia supported Web sites, enhanced 4-18, 1998. 

audio, and video, is emerging. However, to accommodate While the architectures described in the above-mentioned 

such high performance applications and their continuous 45 references are directed resource reservation and manage- 

media flows, it is not enough to increase or reserve network ment of end-to-end resources, they generally assume a 

capacity. These new applications require end-to-end single, even geographically limited network subsystem 

resource reservation and admission control, followed by which provides bounds on delay, errors and meet bandwidth 

co-ordination of distributed functions such as: (a) resource demands, and an operating system which is capable of 

scheduling (e.g., CPU, disk, etc.) at the end-system(s), (b) 50 providing run time QoS guarantees. However, the next 

packet scheduling and flow control in the network, and (c) generation Internet must be viewed not as only a network of 

monitoringofthedeHveredend-to-endquahtyofservice.lt networks, but first and foremost a system of distributed 

is essential that quality of service is configurable, predict- systems. In this paradigm, not only the communication 

able and maintainable system-wide, including the end- resources, but also the computing and storage servers are 

system devices, communications subsystem, and networks. 55 shared among many users. 

Furthermore, all end-to-end elements of distributed systems Thus, the architectures mentioned above do not provide a 

architecture must work in unison to achieve the desired coordinated management of overall system resources as a 

application level behavior. function of request activities for individual content and 

Up do date, there has been considerable effort in the computing resources. Consequently, quality of service must 

development of end-to-end quality of service support. 60 be degraded in response to growing volume of requests for 

Among them are Heidelberg QoS Model, developed within such services over and above an established limit. As the 

HeiProject at IBM's European Networking Center and above-mentioned architectures focus on providing QoS as 

described in the reference entitled "HeiRAT — Quality of requested by application, they do not take an advantage of 

Service Management for Distributed Multimedia Systems", a possible aggregation of resources due to commonality 

Multimedia Systems Journal, 1996 by Volg, C, Wolf, L., 65 between user requests for a particular service. For example, 

Herrtwich, R. And H. Wittig; an Extended Integrated Ref- it would be desirable to determine commonality for the 

ere nee Model (XRM), developed by COMET group at usage history of a particular multimedia content, e.g., bursts 
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of requests within short time intervals, the proximity of 
origination addresses of requests, etc. In addition, the archi- 
tectures described above do not allow for dynamic moni- 
toring and recording of resource consumption for individual 
services as well as for groups of related services, with the 
purpose of calculating cost of service for individual clients. 

Thus, it would be highly desirable to provide a mecha- 
nism capable of providing an adaptive resource management 
function for distributed resources that is suited for the next 
generation of Internet. 

SUMMARY OF THE INVENTION 

It is an object of the present invention to provide a system 
and method for managing multimedia content and server- 
side resources that exploits the unique characteristics of 
future computer networked environments. 

Particularly, it is an object of the invention to provide a 
system and method for managing and controlling the 
distribution, sharing and pooling of resources in an Internet/ 
World Wide Web environment in such a manner that is 
beneficial, accountable, and seamless to the users who are 
requesting access to multimedia content. 

It is a further objective of the invention to provide a 
resource management system that self -regulates, according 
to policies, a plurality of end resources and server-based 
capabilities. 

The present invention is directed to a self-regulated 
distributed network of system resources, i.e., a "resource of 
resources". Particularly, a distributed system referred to as a 
Service Control Plane (SCP) performs self-regiilfltinp man- 
age ment of distributed resources comprising : a plurality of 
ena resources sucn as (but not limited to) servers and 
dataoases providing access to (multimedia) content and 
services , a plurality oi clients requiring access to content and 
s ervices, ana a plurality of system resources connected by a 
computer networ King environment. 

>^ccoraing to the principles ot the invention, the service 
control plane is an intelligent intermediary decision plane 
for managing and enforcing end-to-end properties in the 
availability of end resources in a computer networked envi- 
ronment. The service control plane receives stimuli (i.e., 
application requests) from a plurality of clients desiring to 
access target multimedia content, for instance, and responds 
to these with end-to-end mapping recommendations satis- 
fying one or more properties (policies) set in the service 
control plane. When a client accepts a mapping 
recommendation, the service control plane guarantees its 
performance as set forth by the policies used to generate it. 

The SCP essentially provides the control and management 
of the server-side (end) resources which may be character- 
ized as global, i.e., resources dynamically (on-demand) 
allocated by the system and having a lifetime determined by 
the system; and local, i.e., resources pre-allocated and 
presented to the system as possessing a lifetime that is 
independent of the system. As in any distributed object 
scheme, the local resource is a resource that is associated 
and maintained by its originating private provider. A global 
resource represents an on-demand created resource that is 
made available at one or more global system providers. 

Particularly, the system administrators control the place- 
ment of object replicas onto global resources, as well as the 
ratio of global to local resources for each particular server. 
The placement of replicas, and ratio of global to local 
resources may be constant, or changed based on preferences 
of content owners or service provider. 

Global resources may be made available as a function of 
integrated demand from clients. To satisfy the requests of 
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clients requesting access to target content, global server-side 
resources are allocated. A plurality of intermediary system 
resources (such as directories and resource monitors) enable 
the management of such requests for target content in a 

5 process referred to as mapping, brokering and negotiation. 
According to one aspect of the disclosure, the capacity of 
global resources is dynamic and performed when deemed 
necessary based on the analysis of utilization patterns of 
target content and replicas by pluralities of clients. The 

10 system dynamically shapes capacity by (a) controlling the 
number of replicas associated with an object and (b) the 
placement of these replicas. 

According to another aspect of the invention, the SCP 
additionally controls the mapping of the client requests. 

15 Particularly, methods are provided to dynamically shape 
demand for an object based on criteria such as arrival time, 
incoming geography, and costs requirements. In particular, 
the present invention characterizes future demand for an 
object based on aggregation and forecasting of past demand 

20 for such object. This aspect of the invention is described in 
conjunction with the capacity shaping mechanism. The SCP 
coordinates the dynamic placement of replicas of target 
content across global resources. 

Still another aspect of the invention relates to manage- 

25 ment of the end-resources. A skilled artisan will appreciate 
that today resource management is not feasible in part 
because interfaces to end systems are closed and proprietary. 
According to the system and method of the invention, 
interfaces and mechanisms are provided mat facilitate 

30 resource monitoring, setup, and management. 

To make such resource management a possibility, the SCP 
allows an end-resource to be both globally and locally 
managed with global components managed by the system, 

35 and local components self-managed by the resource. 

Herein a resource is referred to as a server-based capa- 
bility. In the present invention, a server is represented in 
terms of capabilities which represents a heterogeneous 
media unit streaming capacity. Each capability may be 

40 assigned to local or global management, with the service 
control plane permitting the management of global capa- 
bilities while the server itself manages local capabilities. A 
server administrator allocates the ratio of global and local 
capabilities at a server. Resource capabilities need not be 

45 uniform. However, capabilities need to be normalized and 
consequendy, the resource is viewed as a set of storage and 
service bins that represent capabilities for provisioning 
content. A service bin models capabilities in a resource and 
comprises one slot for each different capability tuple. A 

50 server manages a storage bin which is used for each content. 
Content is associated with capabilities via the binding of a 
storage bin with a service bin. However, because not all 
content is identical, it is necessary to obtain a worst resource 
bound for content. Each server is assigned content that is 

55 local and/or replicated. Replicated content is globally main- 
tained while local content is referred to as original content. 

Binding a storage bin to a service bin forms a streaming 
connection — the binding being evaluated at a quality of 
service. The service bin is associated with resource require - 

60 ments needed to stream content in such resolution. Each 
content is associated with a resource profile. For constant bit 
rate, this is possible in terms of bandwidth at the expense of 
computation. For variable bit rate, it is necessary to obtain 
a performance envelope for the worst case. The resource 

65 profile is used to facilitate the evaluation of the binding. 
Advantageously, the system and method of the invention 
provides important enhancements to resource management 
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architectures not yet available today: 1) it provides a highly 
available distributed decision capability that may be used to 
achieve and enforce properties over the usage, as well as 
patterns of usage, of end-resources; 2) it creates a resource 
monitoring point with respect to the performance of multiple 
end resources and clients and their usage patterns so as to 
provide parameters on where, when, and how to satisfy a 
request; and 3) it provides an insertion point that can be used 
for dynamic insertion of both resources and content in a 
manner transparent to clients. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Further features, aspects and advantages of the apparatus 
and methods of the present invention will become better 
understood with regard to the following description, 
appended claims, and accompanying drawings where: 

FIG. 1 illustrates the current Internet as comprising a 
plurality of local networks interconnecting subsets of 
servers, and subsets of clients. 

FIG. 2 is an illustration depicting a single virtual network 
for interconnecting all the clients and servers of FIG. 1. 

FIG. 3 is an illustration depicting the provision of repli- 
cating links from a server (111) to servers (113) and (114), 
which must be initiated in advance, to provide continued 
support of growing demand for objects in the virtual net- 
work of FIG. 2. 

FIG. 4 illustrates the provision of a Service Control Plane 
middleware (400) according to the invention that is located 
between the clients and the server resources of the virtual 
network of FIG. 2. 

FIG. 5 illustrates servers (111, 112 and 113) as comprising 
both local resource storage shaded in dark-gray (501) and 
the global resource storage as shaded as light gray (502). 

FIG. 6 illustrates the SCP (400) as comprising two layers: 
a Service Management Layer and a System Management 
Layer according to the invention. 

FIG. 7(a) illustrates an example replica directory (666) 
including schema and data associated with the replica direc- 
tory. 

FIG. 7(b) illustrates an example server directory (656) 
including schema and data associated with the server direc- 
tory. 

FIG. 8 illustrates the distribution of object replicas over 
the territory of the United States, as determined by the 
System Management Layer of the SCP at a particular instant 
of time. 

FIG. 9 illustrates a graphical depiction of the capacity 
shaping mechanism according to the preferred embodiment 
of the invention. 

FIG. 10 is an illustration depicting the streaming of a first 
type of content consuming resources si, ml, cl and bl and 
a second type of content consuming resources s2, m2, c2 and 
b2. 

FIG. 11 illustrates an example scenario where a system 
administrator has configured all server resources onto two 
partitions, one local and one global. 

DETAILED DESCRIPTION OF THE 
PREFERRED EMBODIMENT 

FIG. 1 is a depiction of the current Internet as comprising 
a plurality of local networks (101, 102, etc.) interconnecting 
subsets of servers, (e.g. HI, U2, etc.) and subsets of clients, 
(e.g. 122, 123, etc.), usually positioned in local proximity 
from those servers. In turn, local networks are connected via 
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routers (e.g. 131, 132, etc.), to a backbone network 100. As 
a result, all clients and servers are interconnected into a set, 
where, abstracting of the security issues, all the clients may 
share all the servers. FIG. 2 is an illustration depicting a 

5 single virtual network 200 interconnecting all the clients 
(121, 122, 123 and 124) and servers (111, 112, 113, and 114). 
For exemplary purposes herein, the network (200) backbone 
resources and its characteristics, such as bandwidth, latency, 
etc., are explicitly regulated by an RSVP network, such as 
vBNS, (not shown) used by the Internet2 or New Generation 
Internet (NGI). 

The network resources, dedicated to any end system, like 
a specific client or server, are limited and may vary. For 
example, resources may provide extra computing power, 

15 such as supercomputing arrays, real-time encoders, such as 
encryption engines, etc., however, for exemplary purposes 
herein, resources include those associated with multimedia 
content servers. It is understood that the solutions provided 
herein may be applicable to any kind of resource. 

2Q Additionally, end-system resources may be heterogeneous, 
using different operating systems, architecture, processors, 
memory bandwidth, storage, etc. It is assumed that the client 
resources are sufficient for running single user applications, 
e.g. media streaming. Such clients may share server 

25 resources. For example, as shown in FIG. 2, clients (122) 
and (123) request server (112) to produce an individual 
stream (215) and (216), respectively, for each of them, and 
clients (121) and (122) share server (111) by. requesting 
streaming connections (201) and (202), respectively. It cli- 

30 ents (123) and (124) desire to request content from the server 
(111) (via dotted lines 203 and 204, respectively) such 
requests may be declined, if the server (111) does not have 
enough of a specific resource, or a combination of resources. 
In the same time, server (113) and (114) may be temporary 

35 under used, having an additional resource to satisfy requests 
from clients (123) and (124), but owning no correspondent 
content. If this content is replicated from the server (111) to 
servers (113), and (114), then requesting clients (123) and 
(124) may be re-addressed to servers (113) or (and) (114). 

4 0 However, two scenarios may govern this: 

First, assuming that the requested content already exists 
on the servers (113) and (114), the system of the invention 
enables tracking of the "hot" content and where it is located. 
In this case, the system automatically re-addresses client 

45 requests to other servers with the same "hot" content, but 
having sufficient resources and willingness to accept those 
requests. 

Second, if there are no more sufficient available resources 
on all the servers managed by the system, the requested 

50 content may be replicated from the server (111) to the 
servers (113) and (114) and made available to clients (123) 
and (124) via respective links (310) and (311), as shown in 
FIG. 3. As media content may be quite large, for example 1 
hour MPEG-2 stream requires on the order of t>l Gbytes of 

55 storage, the replication may take time substantially longer 
than clients (123) and (124) agree to wait. Hence, as shown 
in FIG. 3, the replicating links (301) and (302) must be 
initiated in advance, to continue the support of all the 
growing demand during the time interval that is sufficient for 

60 the replication of this particular content. 

The contemporary Internet streaming applications estab- 
lish individual flow between a client and a server, taking care 
of the QoS, security, encoding and all other issues on 
per-flow basis. In order to provide means to intervene into 

65 this process, a Service Control Plane (SCP) middleware is 
provided such as shown in FIG. 4. The SCP (400) is located 
between the clients (121, . . . ,124) and the server resources 
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(111, . . . ,114), interrupts client requests, finds their rate, available capacity on web servers by dynamically shaping 

density and proximity, determines what content is "hot" and both demand and capacity based on some criteria established 

predicts the distribution of the requests for such "hot" by the system policies (652). Those policies define set 

content. On the other hand, the SCP (400) monitors the criteria for exploration of the willingness of the servers to 

availability of the resources, maps the requests to the servers 5 provide services, negotiation policy between overall demand 

with available resources, predicts utilization of the end- and placement of replicas etc. Farmer details regarding the 

resources and if necessary, dynamically re-distributes the capacity shaping aspect of the invention may be found in 

content. Because the SCP enables borrowing of under used ™~£?™ d > ™^vf ™ ^iSl^^nB 

„ *v j f r ^ c - tl „ No. 09/335,273, entitled SYSTEM AND METHOD FOR 

resources from servers (113) and (114) for increasing the CAPACITY SHAPING IN AN INTERNET 

resources of the server (111), for example, the resources at 10 ENVm0NMENX now v s Pat No . 6 ,466,980 the whole 

the server HI grow sufficiently and on-time to satisfy ^ disclosure of which ^ i ncorporated by refer - 

expeditiously all the requests to the content on this server. ence as ^ m set forth herein 

The SCP also enables returning resources back to the M shQWQ QQ mQ 6 fl ^ (m) instantiates a re est 

lending servers when the rate of requests decreases. (602) wMch ifl mm invokes a negotiator ob ject (603) for 

The invention classifies server resources, which are avail- 35 locating and negotiating the request with the service object 

able for borrowing, as global, and other resources as local. (604), e.g., streaming content. 

Local resources are those resources that are persistent, i.e., As described in aforementioned herein incorporated 

the content using the local resources is not changed dynami- co-pending U.S. patent application Ser. No. 09/335,272 now 

cally by the system, but only by way of system administra- U.S. Pat. No. 6,463,454, the negotiations are based on user 

tors. Global resources are those resources that are used to 20 profiles, per-flow policies, and server and replica directories, 

dynamically replicate the content, in order to accommodate User profiles (611) characterize the client resources, such as 

the overall volume of requests, or demand. For example, processing power, memory, available bandwidth, etc., and 

FIG. 5 illustrates servers (111, 112 and 113) as comprising exemplify user preferences, such as preferable compression, 

both local resource storage shaded in dark-gray (501) and cost limits > interactivity level, e.g., the possibility to control 

the global resource storage as shaded as light gray (502). It ™ mc flow of content in time, previewing set-up, willingness to 

is understood that system administrators are empowered to acce P l ° r reject commercial advertisement, etc. The ne g0 - 

designate the ratio of global resource to local resource at a ob ^ («?) uscs tentatlVC ,/^ C r Cn ^ dC ^ f T 

server. For example, in FIG. 5, the server (114) is configured both a dircctor y ™* (66 ^ ( ^^ . * 

as completely global, while other servers (111 112 and 113) server c ^ s , ervice < see 7 (*»; *f d *f <f es 

are papally global, and partially local. 30 a query strategy to query the servers associated with those 

r J ° tentative placements. The resulting placement is provided 

According to the preferred embodiment of the invention, according to the exploration and negotiation per-flow poli- 
the SCP (400) has three major functions: 1) it controls and cics ExploratioQ policy defincs ^ query stratcgV) e .g. how 
shapes the distribution and placement of requests onto many shcmld be in parallel. The negotiation 
servers; 2) it controls and shapes the distribution and place- policy fa ^ tQ rcfine multiplc tentative placements and 
ment of replicas across servers according to some set enable choosing of a placement based on criteria including: 
criteria; and, 3) it is responsible for dynamically creating, swvcr resourcc availability, e.g., a server having a willing- 
destroying, and moving replicas across servers as deemed nesg „ greeo „ {&ciQT} and capacitV) e ^ a having 
necessary. a « n igh" available capacity. The combination of these two 

FIG. 6 illustrates the SCP (400) as essentially comprising 4Q criteria makes the risk of service rejection minimal over 

two layers: a Service Management Layer (600a) and a time, and facilitates the overall load balancing, than in case 

System Management Layer (600fc). Both layers enable a 0 f p l a cement to a "red" and low capacity server. Per-flow 

mapping transitivity of a received client request to an object policies allow customizing placement, and could either be 

replica, but constitute separate services. loaded during initialization, or on-demand. 

Specifically, the Service Management Layer (600a) pro- 45 As mentioned, negotiations involve optional per-flow 

vides management of the per-flow connection by receiving services, including, but not limited to: a security service 

requests for objects and mapping those requests to the (621), a billing and pricing service (622), a transcoding 

particular server(s) based on factors such as: aggregated service (623), and a streaming service (624). For instance, 

demand statistics, and willingness of the servers to provide the security services (621) may be added to provide user 

the requested services. A more detailed description of this 50 authentication and authorization as well as encryption/ 

aspect of the invention is provided in commonly-owned, decryption of data. The pricing service (622) enables media 

co-pending U.S. patent application Ser. No. 09/335,272, services to be offered at various costs and permit a wide 

entitled SYSTEM AND METHOD FOR INTEGRATED variety of pricing policies. The billing service (622) allows 

LOAD DISTRIBUTION AND RESOURCE MANAGE- the cost of the service to be charged to the service user. The 

MENT IN AN INTERNET ENVIRONMENT, now U.S. 55 transcoding service (623) may request a real-time compres- 

Pat. No. 6,463,454 the whole contents and disclosure of sion method, as requested by a profile. The streaming service 

which is incorporated by reference as if fully set forth (624) allows the media servers to conform to the various 

herein. The Service Management Layer (600a) further pro- functions, such as a provision to publish or update media, 

vides a mapping of a request to the object, taking in The security service (621), billing and pricing service (622), 

consideration various parameters, such as user preferences 6Q transcoding service (623), and streaming service (624) each 

embodied by user policies (611), client parameters, and also may plug-in to the service management layer (600a) via 

per-flow policies embodied as policies (612), such as a custom facades or interfaces 631, 632, 633 and 634, respec- 

policy for service delivery, including availability, quality, lively. It should be understood that the list of services is not 

interactivity level, etc. exhaustive, and others may be added by implementing 

The System Management Layer (600&) provides an 65 associated facades, 

aggregated demand and capacity management and control Within the System Management Layer (6005) the service 

by attempting to match predicted demand for web objects to objects (604) undergo another negotiation process (643) 
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which is directed to generating and distributing object 
replicas over available servers. As shown in FIG. 6, this 
negotiation process takes into account aggregated demand 
statistics (653), selects the set of "hot", or most popular 
objects according to system policies (652) and server 
resource profiles (651). For example, FIG. 8 illustrates the 
distribution of object replicas over the territory of the United 
States, as determined by the System Managemeat Layer 
(600b) at a particular instant of time. 



U.S. patent application Ser. No. 09/335,273 now U.S. Pat. 
No. 6,466,980. As shown in FIG. 9, the line (661) represents 
the number of request streams measured per pre-established 
time interval. The dotted line (662) symbolizes the number 
of possible streams per this interval. During the initialization 
process, as indicated at (663) before the time segment T(i), 
no transient (or global) replicas exist. However, during the 
T(i) interval, the demand exceeds capacity to a degree, 
which leads the System Management Layer (600b) to add 



, t , . „„ , t one transient replica to one of the global servers, which has 

As further shown m FIG. 6 the initial placement of H> ^ to accept me ^ pQSSess a 



15 



replicas onto servers (671), (672) and (673) is defined by the 
service provider and (or) content owner, and reflected in the 
replica directory (666) and server directory (656). The server 
brokers (605) reflect the willingness of the servers to accept 
the service, which is monitored and controlled locally by the 
server itself, and reported as utilization state indicated as 
Red, Yellow or Green states into the server directory (656) 
(see FIG. 7(6)). For example, if the local server (671) holds 
the replica which is requested by the client (601), and the 
condition of the server (671) is "Green," the Service man- 20 
agement layer may bind the client (601) and the replica on 
the server (671), establishing the streaming flow (680) 
between them. After the binding between the client and a 
server is established, the QoS control, interactivity control, 
etc., is further provided by the point-to-point application 25 
itself. However, if the per-flow QoS parameters for an 
individual request cannot be delivered, then the control 
returns back to the System Management Layer, which 
remaps the request to another server. 

It should be understood that the number of replicas 
associated with a given object is variable over time and 
driven by predicted demand for a web object. When the 
Service Management Layer (600a) places the request to the 
System Management Layer (600b), it may be discovered 
that the demand analysis for the replica of the particular 
object and the analysis of availability of capacity may be 
mismatched, e.g., as a result of a decrease or increase of 
demand for the object replica. If the mismatch is of a degree 
in accordance with a set criteria, then the Service Manage- 
ment Layer generates an under/over capacity audits (691) 
such as described in co-pending U.S. patent application Ser. 
No. 09/335,273 now U.S. Pat. No. 6,466,980. In response, 
the System Management Layer initiates a comprehensive 



"green" indicator of resource availability. At the end of the 
interval T(i+1) two transient replicas are added, due to the 
predicted high growth in demand, which generated an under 
capacity condition. However, the prediction is not realized 
during several next time intervals, and finally the demand 
falls to a level which generates an over capacity condition. 
If the replica expiration time shown on FIG. 9 is expired at 
the end of the interval T(N-1), then the System Management 
Layer (600ft) does not enable the replicas to extend their life 
on the related servers. As a result, all the transient replicas 
are destroyed, and the overall capacity will fall to the initial 
level. Finally, as shown in FIG. 9, the demand grows fast 
again, so at the end of the control interval T(N) three new 
transient replicas are added. 

According to the invention, server resource*; may be 
subdivided into two groups of resources: storage and service 
resources. In order to normalize capabilities of the hetero- 
geneous servers, the notion of storage and service bins 
respectively, is introduced. For example, resources required 
by a particular application, e.g., a streaming application , ' 
may be experimentally or theoretically e valuated b y a size of 
s torage s, required to store the content^ and a tuple (m. c, b) . 
where m , c and b symbolize memory, CPU cycle allocation 
(e lg., in million instructions per second (MIPSTK and band- 
W KfilTresources that are required to stream the content . 

FIG. 10 illustrates streaming of MPEG-1 content, e.g., 
consuming resources si (1001), ml (1002), cl (1003) and 
bl (1004). Streaming of another type of content, e.g., one 
4 0 coded in MPEG-2, may require s2 (1011), m2 (1012), c2 
(1013) and b2 (1014) resources. Tlms T as a convenient 



30 
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yieans of measurement of the consumption of resources by 
a vanous classes of applications, the resources are charac- 

, . , . „ , , , , , tenzed as storage s and service rtm.c.b) bi ns. Consequently. 

analysis, which could possibly lead to the creation and/or the resource is viewed asa set of storage and service bS. 
deletion of replica(s), effectively shaping capacity based on which represent : capabilities for provisioning servic'es^ S 
demand. not all applications are identical, it is necessary to obfain a 

As further shown in FIG, 6, via server broker objects worst rcs0 iirce bound for content, requiring experimental 
(605) which represent available servers, the negotiator measurement of different classes of applications for various 

50 servers. However, this approach allows convenient normal- 
ization and reservation of heterogeneous server resources by 



ru. -t» sxere. 



objects (643) finally maps the object replica onto one or 
several global servers, e.g. server (672). Simultaneously, the 
negotiator reports the new placement of the replica to the 
replica directory (656) in the service management layer. The 
mapping of a replica may involve an initiation of replication 
(682), or migration (683), or deletion process. The change in 
replica placement is reported back to the directory (656), 
effectively initiating a process (692) for incrementing/ 
decrementing capacity, i.e., adding/deleting the server 
address out of the replica directory (656). 

Note, that optional system services, such as replication 
(625), or aggregated cost control (626) may be exploited by 
the negotiator object (643) while deciding on the destination 
of replica and speeding up the replication process, 

FIG. 9 illustrates a graphical depiction of the capacity 
shaping mechanism according to the preferred embodiment 65 
of the invention. Further details of the capacity shaping 
mechanism may be found in above-mentioned co-pending 



55 



60 



various classes of applications. 

The relevant disclosure described in greater detail in 
found in commonly-owened, co-pending U.S. patent appli- 
cation Ser. No. 09/335,272 now U.S. Pat. No. 6,463,454 
describes the method of overall resource monitoring for the 
case when several applications run in parallel. However, it is 
assumed that service bins are additive. In other words, the 
overall service resource consumed by several applications 
that are running in parallel is equivalent to a sum of resource 
bins required by each application. The overall storage is 
always a sum of storage bins, unless the applications share 
the same content, and therefore require a common storage 
bin. 

A system administrator configures overall resources as 
local and global. The administrator is responsible for estab- 
lishing the ratio of local to global resources for each server 
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as well as to establishing policies of "green" "yellow** and behind its firewall. The delegate controller performs demand 
"red" load limits for those resources. After configuring a and capacity monitoring tasks for the content provider and 
partition as global storage, the global resource management when critical thresholds are exceeded, the delegate control- 
takes over the control of this resource. Thus, a global storage i cr places a replication request to the ISP's controller. The 
bin represents a partition that can only be reserved by the 5 jsp controller then arbitrates between pending replication 
global resource management provided by SCR Note, that the requests and determines which replication requests get allo- 
system administrator or the server itself, depending on a cated t0 wn icn global servers. In particular, the same global 
relevant policy, may re-claim the global resource in full or scrycr be sharcd across diffcrcnt dclegate contro u C rs. 
partially, by requesting its release from the SCP manage- . . . , 
ment. The native reservation management process (1020) as in Askllled artisan wul appreciate that content providers for 
shown in FIG. 10, is responsible for monitoring resource 10 secunt y reasons > ma y not be wlUm S t0 dlsclose nor aUow 
consumption, and determining the server willingness to another party to access its internal usage statistics. For this 
accept new requests. It is understood that, the native process reason, content providers may be enabled to directly request 
may discriminate between application requests for global to placement of a specific replica onto the ISP global 
and local content. Additionally, a request for placement of a servers. Furthermore, such request may be conditioned (by 
global replica may be declined or accepted, as controlled by 15 the content provider as opposed by the controller) to meet 
a pre-installed policy that enables maximization of the cost particular requirements such as geography or capacity of the 
of global resources that may be billed to a global resource tar 8 et g lobal server - The ISP controller may then try to locate 
requester. Such a policy may also depend on request statis- °ne such particular global server satisfying such require- 
tics monitored by the SCP. ments. For example, a particular content provider may place 
As mentioned, local storage resources are dedicated to 20 a replication request such as "get me a replica of the 
storing persistent replicas of content, while transient replicas following content within the next 5 minutes and locate such 
are replicated using the global storage. Further, the local transient «pJk» ™ a HIGH ca P acit V in the Southeast 
storage bin is always associated with one or more local °f trie USA . 

service bins, and a global storage bin is associated with one Moreover, according to the principles of the invention, 

or more global service bins. 25 ISPs may collaborate in agreements allowing an ISP to 

FIG. 11 illustrates an example scenario where a system handoff replication requests to other ISPs having suitable 

administrator has configured all server resources onto two gl° ba l servers. For example, in the event that none of an 

partitions, one local and one global. As shown in the ISP's global servers satisfies the requirements associated 

example scenario depicted in FIG. 11, two streaming appli- 30 with a request placed by a content provider, the ISP would 

cations share a transient content replica and consume one ton handoff the replication request to some other friendly 

global storage bin s2 (1102) and two service bins r2 (1103) E>P. 

and r3 (1104), while one application consumes a persistent Moreover, according to the principles of the invention, an 

content replica included in service storage bin si (1101). auction for the placement of such replication request may be 

Overall global storage resource consumed is thus equivalent 35 implemented by an intermediary party. Such intermediary 

to s2, while an overall global service resource is a sum of party may then negotiate the handoff such replication 

bins r2 and r3. request to the most capable or suitable party according to 

There are many useful applications of the present inven- some cost metric such as cost or an expiration deadline, 

tion. First, the current QoS approach of many QoS manage- A skilled artisan will appreciate that it is possible for a 

ment systems suggests decreasing of quality, if such an event 40 replica to be placed in a global server, which does no longer 

occurs. Contrary to this, according to the invention, a fit demand characteristics. To this end, the present invention 

customer will receive the quality which he/she agrees to pay augments the replica management system described in 

for. Customers will most probably refuse to buy the content related U.S. patent application Ser. No. 09/335,273 now 

without the ironclad service guarantee, and may even dis- U.S. Pat. No. 6,466,980 by introducing of periodical sanity 

agree with the reduction of quality even if the charges for the 45 checks, i.e., implementation of a process to review and 

service were reduced. The invention demonstrates how to enhance the overall placement of transient replicas on the 

provide a mutually agreed quality of service between the system, which is referred to herein referred to as replica 

client and the system with a high level of service guarantees. migration. 

Additionally, the present invention may be used as a [ n conjunction with the SCP, a final aspect of the inven- 

value-added service by a Internet Service Providers (ISP) to 50 tion is directed to a Resource Management Framework 

the content providers. It allows to support a broadcast (RMF) which is an extensible, distributed, policy-based, 

content providers (such as, for illustrative purposes only, a object-oriented management framework that supports per- 

cable TV News Network "NN") by dynamically matching fl ow me dia session setup and control between one or more 

demand to capacity of NN's servers. When necessary, the me di a servers and end users and may augment the SCP. This 

ISP would place transient replicas (of NN content) on its 55 extensible framework is described in greater detail in 

own global servers based on characteristics about the commonly-owned, co-pending U.S. patent application Ser. 

demand for NN content as presented to the ISP. n 0 . 09/335,275 which is currently pending, entitled 

The present invention may additionally be used to provide POLICY-BASED MULTIVARIATE APPLICATION- 

statistical sharing of ISP resources (i.e., globally-shared LEVEL QoS NEGOTIATION FOR MULTIMEDIA 

servers) when supporting multiple broadcast content pro- 60 SERVICES, the whole contents and disclosure of which is 

viders (such as for illustrative purposes only television and incorporated by reference as if fully set forth herein. The 

media networks). This way, the ISP controller may manage RMF framework comprises a mandatory core set of basic 

the allocation of replication requests according to some cost functions upon which further features can be added. Such 

metric such as profit or content provider maximum loss new extended functional compositions for the framework 

protecting most likely, the best interest of the ISP. 6 5 include the addition of user profiles and preferences, security 

In another implementation, each independent content (authentication and authorization), pricing and billing, 

provider runs a server referred to as a delegate controller reservations, resource monitoring and others. In addition, 
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the policies, upon which management decisions and service 
selections are based, are >phiggable-and new or updated 
policies may be added, to augment or replace existing policy 
and allow new behavior or refinement of existing behavior. 

The RMF is designed to augment the SCP to facilitate the 
management of the heterogeneous media streaming systems. 
The low-level network connections, streaming and media- 
control remain the domain of the media streaming systems. 
RMF hooks into those lower levels and provides a higher 



according to said predetermined criteria, wherein per- 
formance for placing requests of said first management 
means has an input of and considers said capacity 
shaping provided by said second management means. 

2. The system as claimed in claim 1, wherein said client 
resources includes one or more of processing power* 
memory, an available bandwidth for receiving said object 
content at a client resource. 

3. The system as claimed in claim 1, wherein said user 



level management and control of those services. For 10 preferences includes preferable compression of said content, 

example, the RMF includes QoS mapping down to media cost limitations for said content, and interactivity level 

level services based on application level requests, but defers including real-time interaction with flow of said object 

the setup of the network connections to the SCP. Where resource content. 

media streaming may be adaptively rate controlled and 4. The system as claimed in claim 1, wherein said criteria 

managed by the media server, within the bounds permitted 15 includes willingness of said servers to provide said 

by the QoS specification, then the media server can act requested services, and available capacity of said server for 

autonomously. When it is unable to maintain the required providing a streaming resource. 

QoS level then an exception is raised to the SCP, which may 5. The system as claimed in claim 1, further including 

then, according to contact policies involved, attempt to facade means for interfacing per-flow services with said first 

expand or migrate the end-resources, and then re -map the 20 management means, said facade means enabling dynamic 

clients requests, allowing them to receive services else- insertion of both resources and content in a manner trans- 

where. parent to clients. 

While the invention has been particularly shown and 6. A system for regulating server resources in a distributed 

described with respect to illustrative and preformed embodi- wide area network comprising servers interconnected by a 

ments thereof, it will be understood by those skilled in the 25 computer networking environment, said resources including 

art that the foregoing and other changes in form and details multimedia object resources and resources for storing and 

may be made therein without departing from the spirit and streaming said multimedia objects to requesting clients, the 

scope of the invention which should be limited only by the system comprising: 



scope of the appended claims. 

Having thus described our invention, what we claim as 30 
new, and desire to secure by letters patent is: 

1. A system for regulating server resources in a d istributed. 



*-IAa 11 *vUldLui w ^ c arca network comprising servers in terconnected by a 
*TJVtU>-U riMtc**. computer" networking environment, said resources i ncluding 
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multime dia objec t resources and resources for storing and 
streaming said multimedia objects to r equesting c lients, the 
system comprising: 

firof «^rY ; ™ ""ariai"*™^ r"" Qrtc implementing objects for 
r eceiving client reques ts for streaming object resources 
from a server to said^client, generating statistics asso- 
ciated with demand for said object resources, and 
automatically mapping said requests to available serv- 
ers in said network based on predetermined criteria, 
including user profiles for characterizing client 
resources and user preferences relating to receipt of 45 
said object resource, and per-flow services relating to 
one or more selected from the group comprising, 
transcoding object content, offered pricing for said 
content, billing, authorizing and authenticating users, 
and encrypting/decrypting data content, and wherein 
said first management, means__f urther includes one or 
more first negotiator objects for locating a service 
object in said distributed network capable of handlin g 
s aid requestaccording to said predetermined criteria , 
generating and placing inquiries related to said received 55 
requests, and negotiating requests with a service object; 
and, 

second system management means implementing objects 
^for automatically shaping capacity of object resources 
at said servers in said distributed network based on said 
demand statistics, current available capacity of said 
servers and a willingness factor associated with each 
server for providing said requested resource, said sec- 
ond management including means for automatically 
replicating an object associated with said objects 65 
resources and temporarily placing said object replicas 
at one or more server devices having available capacity 
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first service management means implementing objects for 
receiving client requests for streaming object resources 
from a server to said client, generating statistics asso- 
ciated with demand for said object resources, and 
automatically mapping said requests to available serv- 
ers in said network based on predetermined criteria, 
wherein said first management means further includes 
one or more first negotiator objects for locating a 
service object in said distributed network capable of 
handling said request according to said predetermined 
criteria, generating and placing inquiries related to said 
received requests, and negotiating requests with a ser- 
vice object, wherein first negotiator objects invoke a 
first directory service means for locating said service 
object, said first directory service means comprising a 
mapping of service objects to locations of associated 
object replicas at said servers, and said first negotiator 
objects invoke a second directory service for mapping 
available distributed servers with an available capacity 
rating, wherein said second directory service further 
maps a willingness indicator to an associated server, 
said willingness indicator indicating degree of willing- 
ness of a server to receive placement inquiries; 

second system management means implementing objects 
for automatically shaping capacity of object resources 
at said servers in said distributed network based on said 
demand statistics, current available capacity of said 
servers and a willingness factor associated with each 
server for proving said requested resource, said second 
management including means for automatically repli- 
cating an object associated with said objects resources 
and temporarily placing said object replicas at one or 
more server devices having available capacity accord- 
ing to said predetermined criteria, wherein performance 
for placing requests of said first management means has 
an input of and considers said capacity shaping pro- 
vided by said second management means wherein said 
second management means includes one or more ser- 
vice broker objects representing available servers in 
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said network, said service broker objects interacting generating and placing inquiries related to said received 

with said second negotiator objects for placing gener- requests, and 

ated object replicas associated with an object resource . . A . 

to said global servers throughout said distributed net- negotiating requests with a service object, which step of 

work. 5 negotiating requests includes the step of negotiating for 

7. The system as claimed in claim 6, wherein said second per-flow services and implementing one or more 
management means includes means for generating object facades for dynamically inserting optional including 
replicas associated with object resources for placement at both resources and content in a manner transparent to 
global servers throughout said distributed network, said clients, said per-flow services including transcoding 
second management means further including one or more 10 conte nt, offered pricing for said content, billing, 
second negotiator objects for distributing object replicas autbo rizing and authenticating users, and encrypting/ 
over available servers. - . , 

8. The system as claimed in claim 7, wherein said second decrypting data content; 

management means generates object replicas on demand. automatically shaping capacity of object resources at said 

9. The system as claimed in claim 7, wherein a number of 15 servers in said distributed network based on said 
generated object replicas associated with an object resource demand statistics, current available capacity of said 
is determined in accordance with predicted demand for a servers and a willingness factor associated with each 
given object, server capable of providing said requested resource; 

10. The system as claimed in claim 1, wherein said servers 

in said network are normalized as comprising a storage bin 20 automatically replicating an object associated with said 

of a size for storing object resource content, said second objects resources and temporarily placing said object 

management means managing placement of content across replicas at one or more server devices having available 

storage bins so that access optimizations may be made. capacity according to said predetermined criteria, 

11. The system as claimed in claim 10, wherein said wherein performance for placing requests for streaming 
servers in said network are normalized by representing a 2S object resources has an input of and considers said 
server as comprising an associated service bin characterized capacity shaping step- and 

as a tuple comprising memory, CPU allocation and stream- 
ing bandwidth required for provisioning services. characterizing client resources according to user profiles 

12. The system as claimed in claim 10, wherein a object use * preferences relating to receipt of said object 
resource streaming connection between a server and a client 30 resource, and implementing said user profiles and user 
is instantiated by binding a storage bin of an associated preferences as said predetermined criteria. 

server to its service bin while taking into account one or 19, method as claimed in claim 18, wherein said step 

more quality of service parameters. of locating a service object includes referencing a first 

13. The system as claimed in claim 12, wherein a stream- „ a „,;„ a „ n • . . 1 4 . „ * 

r . . . * directory service tor mapping service objects to locations 01 

mg connection is instantiated by binding a storage bin of an 35 . t , 4 * 

• * j * - /• t_-i „ , * • , * associated object replicas at said servers, 

associated server to its service bin while taking into account J r 

one or more quality of service parameters. 20 - ^ method as claimed in claim 19, further including 

14. The system as claimed in claim 10, wherein said referencing a second directory service for mapping available 
second management means manages placement of content distributed servers with an available capacity rating, 
across storage bins according to geography of demand. 40 21. The method as claimed in claim 20, further including 

15. The system as claimed in claim 14, wherein said referencing a second directory service for mapping available 
second management means manages placement of content distributed servers with a willingness indicator indicating a 
across storage bins in a manner so as to support peak d of wi]lin ness of a server t0 receive pi acem ent 
demand. . . . 

16. The system as claimed in claim 14, further including 45 mc l uiries * 

a signaling protocol clients and content servers enabling 22 * ^ as claimed in claim 18, wherein said step 

control servers to monitor the placement of requests from of automatically replicating an object associated with said 

clients onto said servers. objects resources and temporarily placing said object repli- 

17. The system as claimed in claim 1, wherein the wide cas at one or more server devices includes implementing or 
area network comprises the Internet. 50 more second negotiator objects for distributing object rep- 

18. A method for regulating server resources in a distrib- ii cas over available servers 

uted wide area network comprising servers interconnected ^ ^ method as c]aimed ^ daim lfi wherdn said 

by a computer networking environment, said resources c . «. . , j , , c j 

. 1 j. h-j- i- , « c of gener atrng statistics associated with demand for said 

including multimedia object resources and resources for & 6 . 

storing and streaming said multimedia objects to requesting 5S object resources includes: 

clients, the method comprising: predicting a demand for a given object resource; and, 

providing objects for receiving client requests for stream- determining a number of generated object replicas asso- 

ing object resources from a server to said client, gen- dated with gaid demaQd 

eratmg statistics associated with demand for said object ™ \ . 

resources, and automatically mapping said requests to 6 o M * ^ method 35 clauned m claim 22 ' farther mcludm g 

available servers in said network based on predeter- providing one or more service broker objects representing 

mined criteria, wherein said step of automatically map- available servers in said network for interaction with said 

ping requests further includes steps of, second negotiator objects to place generated object replicas 

implementing one or more first negotiator objects for associated with an object resource to said global servers 

locating a service object in said distributed network 65 throughout said distributed network, 

capable of handling said request according to said 25. The method as claimed in claim 18, further including 

predetermined criteria, characterizing said servers in said network as comprising a 



04/30/2004, EAST Version: 1.4.1 



US 6,516,350 Bl 



19 



storage bin of a size for storing object resource content, and 
managing placement of content across storage bins. 

26. The method as claimed in claim 25, further including 
characterizing said servers in said network as comprising an 
associated service bin including a tuple comprising memory, 
CPU allocation and streaming bandwidth required for pro- 
visioning services. 

27. The method as claimed in claim 26, further including 
the step of instantiating an object resource streaming con- 
nection between a server and a client by binding a storage 
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bin of an associated server to its service bin while taking into 
account one or more quality of service parameters. 

28. The method as claims in claim 18, further including 
implementing a signaling protocol between clients and con- 
tent servers for monitoring the placement of requests from 
clients onto said servers. 

29. The method as claimed in claim 18, wherein the wide 
area network comprises the Internet. 
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